Variance Estimation for Finite Populations with Imputed Data

نویسندگان

  • Philip Steel
  • Robert E. Fay
چکیده

One way of handling survey nonresponse is to impute data for each nonrespondent. When estimating sampling variances, however, treating the imputed data as a complete set frequently leads to underestimates of the true sampling variance. Techniques have been recently developed to yield valid variance estimates in the presence of imputed data for some estimators and sample designs. Economic surveys frequently deal with highly skewed populations and employ high sampling rates, including selection with certainty for large units. It is also common for economic surveys to have administrative or historical data available for use in imputation. This paper describes a Monte Carlo study of variance estimation on skewed populations with high sampling fractions in some strata. We examine a variety of imputation techniques and patterns of nonresponse. We extend the Rao-Shao technique to finite populations and nearest neighbor imputation, and compare the resulting estimators to the true variance.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Variance Estimation for Imputed Survey Data with Non-negligible Sampling Fractions

We consider variance estimation for Horvitz-Thompson type estimated totals based on survey data with imputed nonrespondents and with non-negligible sampling fractions. A method based on a variance decomposition is proposed. Our method can be applied to complicated situations where a composite of some deterministic and/or random imputation methods is used, including using imputed data to impute....

متن کامل

Imputation of Missing Data for the Pre-Elementary Education Longitudinal Study

In the Pre-Elementary Education Longitudinal Study (PEELS), imputation of item missing data was done using AutoImpute (AI) software, which uses semi-parametric modeling to form imputation classes. In this paper, we summarize PEELS experience with AI, investigate the bias aspect of the imputed data for the PEELS teacher questionnaire data, and study the variance estimation of imputed data using ...

متن کامل

Balanced Repeated Replication for Stratified Multistage Survey Data under Imputation

Balanced repeated replication (BRR) is a popular method for variance estimation in surveys. The standard BRR method works by rst creating a set of \balanced" pseudo-replicated data sets from the original data set. For a survey estimator ^ , the BRR variance estimator is the average of squared deviations ^ (r) ? ^ , where ^ (r) is the same as ^ but based on the data in the rth pseudo-replicated ...

متن کامل

Fractional hot deck imputation

To compensate for item nonresponse, hot deck imputation procedures replace missing values with values that occur in the sample. Fractional hot deck imputation replaces each missing observation with a set of imputed values and assigns a weight to each imputed value. Under the model in which observations in an imputation cell are independently and identically distributed, fractional hot deck impu...

متن کامل

Selection of Variables that Influence Drug Injection in Prison: Comparison of Methods with Multiple Imputed Data Sets

Background: Prisoners, compared to the general population, are at greater risk of infection. Drug injection is the main route of HIV transmission, in particular in Iran. What would be of interest is to determine variables that govern drug injection among prisoners. However, one of the issues that challenge model building is incomplete national data sets. In this paper, we addressed the process ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1995